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Suppose that G and H are finite, connected graphs, G regular, X 
is a lazy random walk on G and Z is a reversible ergodic Markov chain 
Q | on H. The generalized lamplighter chain X° associated with X and 

Z is the random walk on the wreath product H I G, the graph whose 
^0 vertices consist of pairs (f,x) where / = (fv) vS v(G) * s a labeling of 

the vertices of G by elements of H and a; is a vertex in G. In each 
step, X" moves from a configuration (/, a?) by updating x to y using 
the transition rule of X and then independently updating both f x 
i i and f y according to the transition probabilities on H; f z for z 7^ x, y 

remains unchanged. We estimate the mixing time of X a in terms of 
P I the parameters of H and G. Further, we show that the relaxation 

time of X° is the same order as the maximal expected hitting time 
of G plus \G\ times the relaxation time of the chain on H. 



1. Introduction. Suppose that G and H are finite connected graphs 
CSJ with vertices V(G), V(H) and edges E(G), E(H), respectively. We refer to G 

as the base and H as the lamp graph, respectively. Let X(G) = {/ : V(G) — > 

H} be the set of markings of V(G) by elements of H. The wreath product 

H I G is the graph whose vertices are pairs (/, x) where / = (fv) vt =v(G) e 
^ X{G) and x G V{G). There is an edge between (/, x) and (g, y) if and only 

if (x,y) e E(G), (f x ,g x ),(f y ,g y ) € E(H) and f z = g z for" all z $ {x,y}. 

Suppose that P and Q are transition matrices for Markov chains on G and 
^—i on H, respectively. The generalized lamplighter walk X (with respect to 

the transition matrices P and Q) is the Markov chain on H I G which moves 

from a configuration (/, x) by 

rS ~ 

1. picking y adjacent to x in G according to P, then 

2. updating each of the values of f x and f y independently according to 
Q on H. 
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The state of lamps f z at all other vertices z E G remain fixed. It is easy to 
see that if P and Q are irreducible, aperiodic and reversible with stationary 
distribution ttq and tth, respectively, then the unique stationary distribution 
of X° is the product measure 

n*((f,x))=n G (x)- J] n H (f v ), 
veV{G) 

and X° is itself reversible. In this article, we will be concerned with the 
special case that P is the transition matrix for the lazy random walk on G. 
In particular, P is given by 

,,,, „, s J I if x = y, 

( L1 ) P(x,y):=\\ Xe=mn \ 

for x, y G V(G) and where d(x) is the degree of x. We further assume that 
the transition matrix Q on H is irreducible and aperiodic. This and the 
assumption (1.1) guarantees that we avoid issues of periodicity. 




Fig 1. A typical state of the generalized lamplighter walk. Here H — Z4 and G = Z4, the 
red bullets on each copy of H represents the state of the lamps over each vertex v £ G and 
the walker is drawn as a red W bullet. 

1.1. Main Results. In order to state our general result, we first need to 
review some basic terminology from the theory of Markov chains. Let P be 
the transition kernel for a lazy random walk on a finite, connected graph G 
with stationary distribution ir. 
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The E-mixing time of P on G in total variation distance is given by 

(1.2) tmix(G,e) :=minJt>0: max \ V \P\x, y) - vr(y)| < e 

{ xeV(G) 2 ^ 

Throughout, we set t m ^(G) := t mix (G, |). 

The relaxation time of a reversible Markov Chain with transition matrix 
P is 

(1.3) t rel (G) := — !— 

1 — A2 

where A2 is the second largest eigenvalue of P. 
The maximal hitting time of P is 

(1.4) t ut (G):= max E,^], 

where denotes the first time i that X(t) = y and E x stands for the 
expectation under the law in which X(0) = x. The random cover time t CO v 
is the first time when all vertices have been visited by the walker X, and 
the cover time t cov (G) is 

(1.5) t CO v(G) := max E^Tcov]- 

The next needed concept is that of strong stationary times. 

Definition 1.1. A randomized stopping time r is called a strong sta- 
tionary time for the Markov chain Xt on G if 

P x [X T = y,T = t]=TT(y)P x [T = t}, 

that is, the position of the walk when it stops at r is independent of the value 
ofr. 

The adjective randomized means that the stopping time can depend on some 
extra randomness, not just purely the trajectories of the Markov chain, for 
a precise definition see [13, Section 6.2.2]. 

Definition 1.2. A state h(x) £ V(G) is called a halting state for a 
stopping time t and initial state x if {Xt = h(x)} implies {r < t}. 

Our main results are summarized in the following theorems: 
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Theorem 1.3. Let us assume that G and H are connected graphs with G 
regular and the Markov chain on H is ergodic and reversible. Then there exist 
universal constants c\ , C\ such that the relaxation time of the generalized 
lamplighter walk on H lG satisfies 

(L6) C1 - t hit (G) + \G\ tl . c m * c ^ 

Theorem 1.4. Assume that the conditions of Theorem 1.3 hold and 
further assume that the chain with transition matrix Q on H is lazy, i.e. 
Q(x,x) > \ Vx G H. Then there exist universal constants C2,C2 such that 
the mixing time of the generalized lamplighter walk on H lG satisfies 



[L " t mix {HlG) <C 2 (t cov (G) + \G\t mix (H. 



c 2 (tcov(G) + t rel (H)\G\ log |G| + \G\t mix (H)) < t mix (H I G), 

1 

w\- 

If further the Markov chain is such that 

(A) There is a strong stationary time th for the Markov chain on H which 
possesses a halting state h(x) for every initial starting point x G H , 

then the upper bound of 1.7 is sharp. 

Remark 1.5. The laziness assumption on the transition matrix Q on H 
is only used to get the term C2\G\t ra \ x {H) in (1.7). All the other bounds hold 
without the laziness assumption. 

Remark 1.6. // the Markov Chain on H is such that 

''mix 

(H,s) < tnuxCff, 1/4) + t Ie i(H) logs, 

then the upper bound matches the lower bound. This holds for many natural 
chains such as lazy random walk on hypercube Z!j, tori some walks on 
the permutation group S n (the random transpositions or random adjacent 
transpositions shuffle, and the top-to-random shuffle, for instance). 

Remark 1.7. Many examples where Assumption (A) holds are given in 
the thesis of Pak [ ], including the cycle Z n , the hypercube and more 
generally tori Z^,n,d G N and dihedral groups Z2 x Z n ,n G N are also 
obtained by the construction of strong stationary times with halting states 
on direct and semidirect product of groups. Further, Pak constructs strong 
stationary times possessing halting states for the random walk on k-sets of 
n-sets, i.e. on the group S n /(Sk x #„_&), and on subsets of n x n matrices 
over the full linear group, i.e. on GL(n,¥ q )/(GL(k,¥ q ) x GL(n — k,¥ q )). 
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1.2. Previous Work. The mixing time of Z2 I G was first studied by 
Haggstrom and Jonasson in [ ] in the case of G being the complete graph 
K n and the one-dimensional cycle Z n . Generalizing their results, Peres and 
Revelle [18, Theorem 1.2, 1.3] proved that there exists constants Cj, Cj de- 
pending on e such that for any transitive graph G, 

cit m (G)<t Te i{Z 2 lG)<C 1 t m {G), 

C2*cov(G) < t m ix(Z2 lG,£) < C2t cov (G). 

The vertex transitivity condition was dropped in [13, Theorem 19.1, 19.2]. 
These bounds match with Theorems 1.3 and 1.4 since H n = Z2 implies that 
the terms not containing H n in the denominator of (1.6) and in the bounds 
in (1.7) dominate. 

In [16], it is shown that t m ix(Z2 I G n ) ~ ^t cov (G n ) whenever (G n ) is a 
sequence of graphs satisfying some uniform local transience assumptions, 
including G n = Z^ with d > 3 fixed. 

Moving towards larger lamp spaces, if the base is the complete graph 
K n and \H n \ = o(n) one can determine the order of mixing time from 
[13, Theorem 20.7], since in this case the lamplighter chain is a product 
chain on YYi=± H n . Levi [14] investigated random walks on wreath products 
when H 7^ Z2. In particular, he determined the order of the mixing time of 
K n \ I K n , < A < 1, and he also had upper and lower bounds for the case 
HdlZ n , i.e. H is the ci-dimensional hypercube and the base is a cycle of length 
n, however, the bounds failed to match for general d and n. Further, Fill and 
Schoolfield [10] investigated the total variation and I2 mixing time of K n lS n , 
where the base graph is the Cayley graph of the symmetric group S n with 
transpositions chosen as the generator set, and the stationary distribution 
on K n is not necessarily uniform. 

The mixing time of H n = Z2 is closely related to the cover time of the 
base graph, and thus it helps understanding the geometric structure of the 
last visited points by random walk [4-6, 16]. Further, larger lamp graphs 
give more information on the local time structure of the base graph G. This 
relates our work to the literature on blanket time (when all the local times 
of vertices are within a constant factor of each other) [3, 8, 20]. 

1.3. Outline. The remainder of this article is structured as follows. In 
Section 3 we state a few necessary theorems and lemmas about the Dirich- 
let form, strong stationary times, different notions of distances and their 
relations. In Lemmas 3.3 and 3.5 we construct a crucial stopping time r° 
and a strong stationary time rf on H I G which we will use several times 
throughout the proofs later. Then we prove the main theorem about the 
relaxation time in Section 4, and the mixing time bounds in Section 5. 
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2. Notations. Throughout the paper, objects related to the base or the 
lamp graph will be indexed by G and H, respectively, and o always refers 
to an object related to the whole H I G. Unless misleading, G and H refers 
also to the vertex set of the graphs, i.e. v £ G means v £ V(G). P^,E^ 
denotes probability and expectation under the conditional law where the 
initial distribution of the Markov chain under investigation is [i. Similarly, 
P x is the law under which the chain starts at x. 

3. Preliminaries. In this section we collect the preliminary lemmas to 
be able to carry through the proofs quickly afterwards. The reader familiar 
with notions of strong stationary times, separation distance, and Dirichlet 
forms might want jump forward to Lemmas 3.3 and 3.5 immediately, and 
check the other lemmas here only when needed. 

The first lemma is a common useful tool to prove lower bounds for relax- 
ation times, by giving the variational characterization of the spectral gap. 
First we start with a definition. 

Let P be a reversible transition matrix with stationary distribution it 
on the state space Q, and let E„-[</>] := Ylyen^iv) 7 ^^)- The Dirichlet form 
associated to the pair (P, tt) is defined for functions 4> and r\ on f2 by 



The next lemma relates the spectral gap of the chain to the Dirichlet form 
(for a short proof see [2] or [13, Lemma 13.12]): 

Lemma 3.1 (Variational characterization of the spectral gap). The spec- 
tral gap 7 = 1 — A2 of a reversible Markov Chain satisfies 



A very useful object to prove the upper bound on t Ye \ and both bounds 
for t m i x is the concept of strong stationary times. Recall the definition from 
(1.1). It is not hard to see ([13, Lemma 6.9]) that this is equivalent to 



S(<f>, r,) := ((I - P)<f>, r,) n = £(I - P)4>(y)r,(y)ir{y). 



yen 

It is not hard to see [13, Lemma 13.11] that 



(3.1) E(4>) := £(4>, <(>) = ^ [(<KXi) - 4>(Xo)) 2 } 



(3.2) 



7 = 




where Va r?r = E^^ 2 ] - (E^]) 2 . 



(3.3) 



[X t = y,T<t] 



TT(y)P x [T <t}. 
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To be able to relate the tail of the strong stationary times to the mixing time 
of the graphs, we need another distance from stationary measure, called the 
separation distance: 



(3.4) s x (t) := max 

yen 



P\x,y) 
7r(y) 



The relation between the separation distance and any strong stationary time 
r is the following inequality from [2] or [7] or [13, Lemma 6.11]: 

(3.5) VxeQ: s x (t) < P x (t > t). 

Throughout the paper, we will need a slightly stronger result than (3.5). 
Namely, by [7, Remark 3.39] or from the proof of (3.5) in [13, Lemma 6.11] 
it follows that in (3.5) equality holds if r has a halting state h{x) for x. 
Unfortunately, we just point out that the [13, Remark 6.12] is not true 
and the statement can not be reversed: the state h(x,t) maximizing the 
separation distance at time t can also depend on t and thus the existence of 
a halting state is not necessarily needed to get equality in (3.5). 

On the other hand, one can always construct r such that (3.5) holds with 
equality for every x G Q. This is a key ingredient to our proofs, so we cite it 
as a Theorem (with adjusted notation to the present paper). 

Theorem 3.2. [Aldous, Diaconis][l, Proposition 3.2] Let (X t ,t > 0) be 
an irreducible aperiodic Markov chain on a finite state space f2 with initial 
state x and stationary distribution n, and let s x (t) be the separation distance 
defined as in (3.4). Then 

1. if t is a strong stationary time for Xt, then s x {t) < P x (t > t) for all 
t > 0. 

2. Conversely, there exists a strong stationary time r such that s x (t) = 
P x (t > t) holds with equality. 

Combining these, we will call a strong stationary time r separation opti- 
mal if it achieves equality in (3.5). Mind that every stopping time possessing 
halting states is separation optimal, but the reversed statement is not neces- 
sarily true. The next two lemmas, which we will use several times, construct 
two stopping times for the graph H I G. The first one will be used to lower 
bound the separation distance and the second one upper bounds it. 

We start with introducing the notation 

t 

(3.6) L v {t) = 2 H*i = «) - Sxo,v ~ S Xt ,v 

i=0 
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for the number of moves on the lamp graph H v , v G G by the walker up 
to time t. Slightly abusing terminology, we call it the local time at vertex 
veG. 

Let us further denote the random walk with transition matrix Q on H by 
Z. Since the moves on the different lamp graphs H v , v G G are taken inde- 
pendently given L v (t),v G G, we can define for each v G G an independent 
copy of the chain Z, denoted by Z v , running on H v . Thus, the position of 
the lamplighter chain at time t can be described as 

(F t ,X t ) = ((Z v (L v (t))) veG ,X t ) 

Below we will use copies of a strong stationary time tjj for each v G G, 
meaning that th(v) is defined in terms of Z v , and given the local times 
L v (t), th(v)-s are independent of each other. 

Lemma 3.3. Let th be any strong stationary time for the Markov chain 
on H. Take the conditionally independent copies of (th(v)) vgG given the 
local times L v (t), realized on the lampgraphs H v -s and define the stopping 
time t° for X° by 

(3.7) r° := inf {t : \/v G G : t h {v) < L v (t)} . 
Then, for any starting state (f ,xq) we have 

(3.8) [X? = (/,x),r<> = t] = HMfv) ■ P(/ ,,o) [*t = x,r* = t]. 

If further th has halting states then the vectors (h(f v (0)),y) are halting state 
vectors for t° and initial state (/ , xq) for every y £ G. 

We postpone the proof and continue with a corollary of the lemma: 

Corollary 3.4. Let th be a strong stationary time for the Markov 
chain on H which has a halting state h(z) for any z G H . Then define r° 
as in Lemma 3.3. Then for the separation distance on the lamplighter chain 
H I G the following lower bound holds: 

S {Lo , Xo) (t)>P {l(yXo) [T«>t]. 

Proof. Observe that reaching the halting state vector (h(f v (0)),x) im- 
plies the event r° < t so we have 
(3.9) 

P iLo , xo) [X? = (h(f v (0)),x)\ t P {Uxo) [Xf = (h(f v (0)),x),T» <t] 

7T G (X) I] Tff (/*(/„(()))) ' TTG(X) II (Hf v (0))) 
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Now pick a vertex x XQ ,t £ G which minimizes P [Xt = x X0) t\T° < t] /irG(x Xo ,t)- 
This quotient is less than 1 since both the numerator and the denominator 
are probability distributions on 67. Then, using this and Lemma 3.3, the 
right hand side of (3.9) equals 

Clearly the separation distance is larger than the left hand side of (3.9), and 
the proof of the claim follows. Note that the proof only works if th has a 
halting state and thus it is separation-optimal. □ 

Proof of Lemma 3.3. First we show that (3.8) holds using the con- 
ditional independence of th(v)-s given the number of moves L v (t) on the 
lamp graphs H(v),v € G. Clearly, conditioning on the trajectory of the 
walker {X±, . . . ,X t -±,X t = x} := X[l,t] contains the knowledge of L v (t)-s 
as well. We will omit to note the dependence of P on initial state (f Q ,xo) 
for notational convenience. The left hand side of condition (3.3) equals 

P [Xt = (f,x),r° < t] =J> [Xt = (/,x),T° < t\X [l>t] ] P [X M ] . 

x [l,t] 

Recall that Z v stands for the Markov chain on the lamp graph H v , and their 
conditional independence given L v (t)-s. Due to (3.3) and th being strong 
stationary for H we have for all v £ G that 

P[Z v (L v (t)) = f v ,r H (v) < L v {t)\X [lA ] = 7T H (fv) ■ P[th(v) < L v (t)\X [ht] }. 

Now we use that th(v)-s are conditionally independent given the local times 
to see that 

V[Xt = (f,x),T*<t\X [ltt] ] 

= P [Vv G G : Z v (L v (t)) = f v ,T H (v) < L v (t),X t = x, \X [1>t] ] 

Note that the second product gives exactly P [r° < t|Xr 1)t |], yielding 
(3.10) P [Xt = (/,x),r° < t] = l[n H (f v )J2 p K < t\X[i,t)] P[^[i,t]] 

v&G X M 

As Xt = x remains fixed over the summation, thus summing over all possible 
JT[l,t] trajectories yields 

P[Xt = (I, x),T <> <t}= Hw H (f v )P[X t = X,T°<t]. 

veG 
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To turn the inequality r° < t inside the probability to equality can be done 
the same way as in (3.3) and left to the reader. To see that the vector 
of halting states (h(f v (0)),y) is a halting state for r° for any y £ G is 
based on the simple fact that reaching the halting state vector (h(f v ) v& G,y) 
means that all the halting states h(f v ),v G G have been reached on all the 
lamp graphs H v ,v 6 G-s. Thus, by definition of the halting states, all the 
strong stationary times th{v) have happened. Then, by its definition, r° has 
happened as well. □ 

Recall the definition (3.7) of T d iamond. Then we can construct a strong 
stationary time for H I G, described in the next lemma. 

Lemma 3.5. Let t° be the stopping time defined as in Lemma 3.3, and 
let tq{x) be a strong stationary time for G starting from x G G and define 
r^by 

(3.11) rl :=t« + t g {X t «), 

where the chain is re-started at tq is started from {,F_ T i amon( i,X T d^ amon( f), 
run independently of the past and tq is measured in this walk. Then, t% is 
a strong stationary time for H lG. 

Proof of Lemma 3.5. The intuitive idea of the proof is based on the 
fact that tq is conditionally independent of r#-s and thus the lamp graphs 
stay stationary after reaching r°, and stationarity on G is reached by adding 
the term tg{X t o). The proof is not very difficult but it needs a delicate 
sequence of conditioning. To have shorter formulas, we write shortly P for 
P(y tXQ y First we condition on the events {r° = s,X° = (g,y)} and make 
use of (3.8) from Lemma 3.3. 
(3.12) 

P [Xt = (/, x), = t] = Yl P i X t = (/> = = 8, X: = (g, y)] 

s<t;(g,y) 

■ Y[n H (gv)-P[T* = 8,X a = y]. 
veG 

Now for the conditional probability inside the sum on the right hand side 
we have 

P [Xt = (/, x), r| = t\r* = s, Xt = (g, y)] 
= P [Xt = (/, x); r G {y) o9 s = t- S |t° = a, Xt = (g, y)] 
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where T G {y) S means the time-shift of T G (y) by s, and we also used that 
t g is only depending on y. We claim that 

U(g,y) [Xt- S = (l,x),T G (y) = t - s] J[ 7T H (g v )) 
9 \ v&G / 

= Py[Xt-s = x, T G (y) = t - s] Y[ KH(fv) 

= TT G (x)P y [T G = t - S] JJ TT H (f v ). 

The first equality holds true since r G (y) is independent of the lampgraphs 
and the transition rules of X° on H I G tells us that the lamp-chains stay 
stationary. We omit the details of the proof. The second equality is just 
the strong stationarity property of t g . Thus, using this and rearranging the 
order of terms on the right hand side of (3.12) we end up with 

FylTG = t - s]P[t = s,X s =y}- tt g (x) ]J n H (fv)- 

s<t,y£G «eG 

Then, realizing that the sum is just P[r° + tg(AV«) = t] finishes the proof. 

□ 



We continue with a lemma which relates the separation distance to the 
total variation distance: Let us define first 

(3.13) d x (t) ■= \\P\x, •) - 7t(-)||tv = \ £ \P\x, y) - n(y)\ . 

yen 

The total variation distance of the chain from stationarity is defined as: 

d(t) := max d x (t). 

The next lemma relates the total and the separation distance: 

Lemma 3.6. For any reversible Markov chain and any state i£(l, the 
separation distance from initial vertex x satisfies: 

(3.14) d x (t) < s x (t) 

(3.15) s x (2t) < 4d(t) 

Proof. For a short proof of (3.14) see [2] or [13, Lemma 6.13], and 
combine [13, Lemma 19.3] with a triangle inequality to conclude (3.15). □ 
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We will also make use of the following lemma: ([13, Corollary 12.6]) 
Lemma 3.7. For a reversible, irreducible and aperiodic Markov chain, 

lim d(t) 1 /' = A*, 

t— >oo 

with A* = max{|A| : A eigenvalue of P, A / 1}. 

The two fundamental steps to prove Lemma 3.7 are the inequalities stating 
that for all x E we have 

A* 

4(t) < **(*) < — , 
(3.16) 7r min 

|A 2 |* < 2d(t) 

with 7r m i n = min^gn n(y)- This inequality follows from [13, Equation (12.11), 
(12.13)]. 

We note that Lemma 3.6 implies that the assertion of Lemma 3.7 stays 
valid if we replace d(t) 1 /* by the separation distance s(t) 1 /*. 

4. Relaxation time bounds. 

4.1. Proof of the lower bound of Theorem 1.3. We prove c\ = 1/(16 log 2) 
in the lower bound of the statement of Theorem 1.3. First note that it is 
enough to prove that ihit(G) and \G\t Te \(H) are both lower bounds, hence 
their average is a lower bound as well. First we start showing the latter. 

Let us denote the second largest eigenvalue of Q by Xh and the cor- 
responding eigenfunction by tp. It is clear that ~E^ H (tp) = and we can 
normalize it such that Var 7rH (^/?) = E 7rH (^ 2 ) = 1 holds. Let us define 

<t>:V(HlG)^R, <ft((f,x)) = 

weG 

thus (ft is actually not depending on the position of the walker, only on the 
configuration of the lamps. Let X£ = (F t ,Xt) be the lamplighter chain with 
stationary initial distribution ir°. In the sequel we will calculate the Dirichlet 
form (3.1) for (ft at time t, first conditioning on the path X[0, t] of the walker: 

£ t [(ft] = \vA(<ft(X?) - <ft(X«)f) 
(4-1) I 

= -E 7r o(E 7r o[(0(X^) - <ft(X»)) 2 \X[0,t}]) 
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We remind the reader that in each step of the lamplighter walk, the state 
of the lamp graph H v is refreshed both at the departure and arrival site 
of the walker. Thus, knowing the trajectory of the walker implies that we 
also know L v (t), the number of steps made by the Markov chain Z v on H v . 
Moreover, the collection of random walks (Z v ) veG on the lamp graphs are 
independent given L v (t)-s. 

We can calculate the conditional expectation on the right hand side of 
(4.1) by using the argument above and the fact that E nH (ip) = as follows: 

(4.2) 

E^o [{ct>(Xt) - <f>(X«)) 2 \X[0,t}} =Y^n^(Z v (L v (t))-i;(Z v (0))) 2 \L v (t) 

Next, the product form of the stationary measure tt ensures that we can 
move to 7Tff inside the sum and condition on the starting state Z v (0): 



E, 



(^(Z v (L v (t)))-t/j(Z v (0))) 2 \L v (t) 

2E 7rH V 2 - 2E^ H ty(Z v (0))E Zv(0) [TP(Z v (L v (t))) \Z V (Q)]] , 



Since ip was chosen to be the second eigenfunction for Q, clearly 
E Zvi0) [ip(Z v (L v (t)))] = \ L E ' 
EwhIV' 2 ] = I; we arrive at 



Ez v (p)[ip {Zv(L v (t)))] = X H V ift(Z v (0)). Using the normalization 



(4.3) E^ [(ftXt) - W)) 2 l^[0, *]] = 2\G\ - 2 £ A^ (i) 

ueG 

Since X^eG L v (t) = 2i and the function X X H is convex, Jensen's inequality 
implies that 

J2*H {t) >\G\-\% lG] . 

veG 

Combining this with (4.3) and (4.1) and setting t := t* = \G\t Te \(H) = 
\G\/(1 — Xh) we arrive at 

Sttf) < \G\ (l - e 2 ^f ) < \G\ (1 - 2- 4 - 1 ) , 

where in the last step we assumed Xh > 1/2, since in this case we have 
(l-Atf) _1 logA# > -2 log 2. On the other hand, if X H < 1/2, than t Te i(H) < 
2 and we will use the other lower bound thit(G) which is at least of order 
Dividing by Vaiv>(/> = \G\, and using the variational characterization 
of the spectral gap in Lemma 3.1, we get that the spectral gap 7t* at time 
t* satisfies 

7f < 1 - 2~ 5 - 
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Since jt is by definition the spectral gap of the chain at time t, we have 



(4.4) 
Thus 



l-X 2 (HlGy < 1-2 



51og2>i*(l-A 2 (mG)), 

so we get a lower bound t re \(H I G) > 5l ^ s2 \G\t re \(H). 

To get the lower bound ihit(G)/4 we adjust the proof for — 1 lamps 
{H = Z2) [ , Theorem 19.1] to our setting. First pick a vertex w £ G which 
maximizes the expected hitting time E vrG (r u ,). As before, we will use the 
second eigenfunction ip with eigenvalue \h with E 7TH (tp) = 0,E 7rjJ (^ 2 ) = 1 
and define 

((/,*)) :=ip(U). 

Easy to see with the same conditioning argument we used in (4.2) and (4.3) 
that the Dirichlet form at time t equals 



£t(<f>) = 1 - E^c 



L w {t) 
H 



L w (t) 
H 



> 1/4. To see this we first note that for 



Now we will show that E^ 
any t we have for the hitting time t w of w G G 

E„(r m ) < t + t hit P v [r w > t] 
E 7rG (r w ) < t + thitP7r G [T«i > t] 

To see the first line: either the walk hits w before time t, or the expected 
additional time it takes to arrive at w is bounded by thit regardless of where 
it is at time t. The second line follows by averaging over ttq- 

Next, [13, Lemma 10.2] states that ihit < 2 max„ E^-fr^] holds for ev- 
ery irreducible Markov chain. We exactly picked w such that it maximizes 
E 1G (r«), so we have ihit < 2E 7rG [r ! „], so multiplying the previous displayed 
inequality by 2 gives 

ihit < 2t + 2thitP7r G [T«, > t] 
Now substituting t = thit/4 and rearranging terms results in 



7T G 



*hit 



> 



Since {L w (t hit /4) 
the upper bound 



0} = {t w > thit/4}, we can use this inequality to obtain 



£ W4 (0) = 1-E, 



■M*hit/4) 
H 



< l-P na [T w > thit/4] < 1 



3 
4" 
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Analogous to the last lines of the proof of the lower bound above, (see (4.4)) 
we obtain the other desired lower bound: 

t Te i(H) > -t hit (G). 
v ; ~ 2 log 2 4 v 1 

Putting together the two bounds we get 

t Iel (H l G) > max { — *— t m (G), -^—\G\t rcl (H) 
{ 8 log 2 5 log 2 

> Y(yLg2 {th[t{G) + I^IWW) • 

4.2. Proof of the upper bound of Theorem (1.3). To prove the upper 
bound, we will estimate the tail behavior of the strong stationary time T2 
in Lemma 3.5, relate it to s°(i), the separation distance on H I G, and then 
use Lemmas 3.7 and 3.6 to see that ^(i) 1 /* ->■ A°. We will use separation- 
optimal th and tq in the construction of . The existence is guaranteed by 
Theorem 3.2. We will use P for P(/ i2: ) f° r notational convenience. Combining 
(3.5) and the fact that r° happens when all the stopping times th(v),v £ G 
have happened on the lamp graphs, by union bound we have for any choice 
of < a < 1 

sy tX) (t) < P (Lx) [r 2 ° > t] < P m [r° > at] + P (£)2) [t» > t\r" < at] 

(4.5) < P[r cov >at/3] 

(4.6) + P 3w £ G : L w (at) < ^\t cov < at/3 

(4.7) +P 



at 

3w £ G : t h (w) > L w (at) Nv G G : L v (at) > 



2\G\\ 



(4.8) + maxP faiy) [t g > (1 - a)t] 

(9,y) 



Namely, there are four possibilities: The first option is that there is a state 
w E G which is not hit yet, i.e. the cover time of the chain is greater 
than at/3: giving the term (4.5). The constant 1/3 could have been chosen 
differently, we picked at/ 3 such that the remaining 2at/3 time still should 
be enough to gain large enough local time on the vertices v G G. Secondly, 
even though any state w on the graph G is reached before time at/3, the 
remaining time was not enough to have at least at/2\G\ many moves on 
some lamp graph H(w), term (4.6). The third option is that even though 
there have been many moves on all the lamp graphs, there is a vertex w £ G 
where th{w) has not happened yet, yielding the term (4.7). We will handle 
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the three terms separately. The fourth term handles the case where the 
strong stationary time tq is too large. (For convenience, we will write t 
instead of at in estimating the first three formulas.) 
We can estimate the first term (4.5) by a union bound: 

_ log2 t 

(4.9) P[r cov > t/3] < P[3w : t w > t/3] < \G\2e 6 t ut ; 

where £hit is the maximal hitting time of the graph G, see (1.4). To see this, 
use Markov's inequality on the hitting time of w € G to obtain that for all 
starting states v G G we have P v [r w > 2thit] < 1/2, and then run the chain 
in blocks of 2thit- I n each block we hit w with probability at least 1/2, so we 
have 

P v [t w > K(2t hit )} < ^. 

To get it for general t, we can move from [V^WtJ to t/thit by adding an extra 
factor of 2, and (4.9) immediately follows by a union bound. 

For the third term (4.7) we claim the following upper bound holds: 
(4.10) 



1 



t 



3w:r H (w)>L w (t)\yv:L v (t)>^ m < \G\ — e 2|G|wW. 

To see this we estimate the probability of the event {th(w) > L w (t) \L w (t) > 
2^j\} on a single lamp graph and then use a union bound to lose a factor \G\ 
and arrive at the right hand side. First note that according to Lemma 3.7, 
the tail of the strong stationary time th is driven by A^. More precisely, 
using the inequality (3.16) we have that for any initial state h G H: 



t h {w) > 



2\G\ 



S Sh tTTJT S 7~F7V A 



1 f (l-A^)t 

— 77r\ ex P 



7r min (H) " I 2\G\ 



Since we have made at least L w (t) > steps on each coordinate, the claim 
(4.10) follows. The fourth term (4.8) can be handled analogously and yields 
an error probability exp{— ct/t Te i(G)} which then, taking the power of 1/t 
and limit as in Lemma 3.7, will lead to a term of order t re \(G). Then, taking 
into account that t re \(G) < ct m ; x (G) < Ct^ lt {G) holds for any lazy reversible 
chain (see e.g. [13, Chapter 11.6,12.4]), we can ignore this term. 

The intuition behind the estimates below for the second term (4.6) is that 
since the total time was at least 2t/3 after hitting, regularity of G implies 
that the average number of moves on a lamp graph equals 4t/(3|G|) by the 
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double refreshment at any visit to the vertex. Thus, the probability of having 
less than t/{2\G\) moves must be small. 

More precisely, we introduce the excursion- lengths to a vertex w G G: Let 
us define for all w G G the first return time to state w as 

R(w) = inf{t > : X t = w\X = w}. 

The strong Markov property implies that the length of the i-th excursion 
Ri(w), defined as the time spent between the (i — l)th and ith visit to w, 
are i.i.d random variables distributed as the first return time R(w). 

Thus, having not enough local time on some site w G G can be expressed 
in terms of the excursion lengths Ri(w)-s as follows: 



(4.11) P 



3w : L w (t) < 2|^| | 



< 



< \G\ maxP t 

«/6G 



t/A\G\ 



2t 



i=l 



since conditioning on hitting before i/3 ensures that we had at least 2t/3 
steps to gain the £/4|£r| visits to w, and by the definition (3.6) of L v (t), this 
guarantees that L w (t) < t/2\G\. 

We aim to estimate the right hand side of (4.11) using the moment gen- 
erating function of the first return time R(w). To be able to carry out the 
estimates we need a bound on the tail behavior of the return times. A very 
similar argument can be used to the one we used for the tail of the cover 
time (4.9), namely the following holds: 



[R(w) > 2t m + 1] = P 

< 



[Xi^to]E[P Xl (r w > 2i hit |X 1 )] 
< 



E[E Xl (^)] . 1 



Running the chains in blocks of 2ihit + 1) one can see that in each block 
the chain has a chance at least 1/2 to return to w, so we have for each 
t > 2t U t + 1 



(4.12) 



P [Rw > t] < 2 



1 \ 2t h 



3exp 



log 2 t 

2 ihit 



where the factor 3 comes from ignoring to take the integer part of t / i hit an d 
neglecting the +1 term in the denominator. 

We can use this tail behavior to estimate the moment generating function 



E 



< e mo\ + ^[ e PR^\{R w > 2 |G|}] 
P[e^ > z]dz 



e W\G\ + 



2d\G\ 
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where we cut the expectation at 2\G\. Using the bounds in (4.12) yields: 



E 



< e W\G\ + 



OO j 

P[R W > -\ogz]dz 

ma\ p 

log 2 



poo log^ 

< e 2/3|G| +2 / z'V^dz. 



Setting arbitrary f3 < log 2/(2t^ t ) makes the second term integrable, and 
with the special choice of /3 = we obtain the following estimate: 



(4.13) 



E 



< e W\G\ + 2e -2/3|C| < e (2+6)/3\G\ 



with an appropriately chosen < 5 < 1/3. Now we apply Markov's in- 
equality to the function e^^=i R i( w ) to estimate the right hand side of 
(4.11): 



(4.14) 



t/4,\G\ 

Ri(w) > 2t/3 

-i=i 



< e"3 



2 



E 



4\G\ 



where we also used the independence of the excursions Ri(w)-s. Using the 
estimate in (4.13) to bound the right hand side we gain that 



(4.15) 



t/4\G\ 

RiH > 2t/3 

1=1 



l-<5 



< e" 



fit 



exp 



(1 - 5) log 2 t 
24 t^~ t 



where we used (3 = log 2/(4^), and modified S := 35/2 < 1/2. Using the 
relation of the local time to the excursion lengths in (4.11) we finally get 
that the second term (4.6) is bounded from above by 

log 2 t 



(4.16) 



3w : L w (t) < 



2\G\ 



^cov ^ t/3 



< |G|exp 



48 tut 



Mind that all the estimates (4.9), (4.10) and (4.16) were independent of the 
initial state (f,x) G H I G, so using the second inequality in (3.16) and 
maximizing over all possible initial states yields us 
(4.17) 



|A 2 |* < 2(f(t) < 2s°(t) < 



4 exp 



4|G| 



vr min (ii) 

(1-5) log 2 t 
24 



2\G\t rcl (H) 
\ +4|G|exp 



log 2 t 
6 thit 
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In the final step we apply Lemma 3.7: we take the power 1/t and limit as t 
tends to infinity with fixed graph sizes \G\ and \H\ on the right hand side of 
(4.2) to get an upper bound on A2. Then we use that (1 — e~ x ) < x + o(x) 
for small x and obtain the bound on t re \ finally: 

f 48 
t Iel (H I G) < max <^ 2|G|t rel (#), ^*Ut 



log 2 

This finishes the proof of the upper bound on the relaxation time. 

5. Mixing time bounds. Based on the fact that H has a separation- 
optimal strong stationary time tjj, the idea of the proofs is to relate the 
separation distance to the tail behavior of the stopping times r° and r| 
constructed in Lemmas 3.3 and 3.5, respectively. Then these estimates are 
turned into bounds of the total variation distance using the relations in 
Lemma 3.6. This method gives us the upper bound in (1.7) and the cor- 
responding lower bound under the assumption (A). For the lower bound 
without the assumption, we will need slightly different methods. 

5.1. Proof of the upper bound of Theorem 1.4- The idea of the proof is 
to use appropriate top quantiles of the strong stationary time tjj on H, and 
give an upper bound on the tail of the strong stationary time r| defined in 
Lemma 3.5. Throughout, we (only) need that tjj and tq in the construction 
of r 2 are separation-optimal. The existence is guaranteed by Theorem 3.2. 
(Thus, tjj does not necessarily possess halting states.) 

Let us denote the worst-case initial state top e-quantile of a stopping time 
r as 

(5.1) ant (r) := maxinf{t : Pjr > t] < e} 

yen 

We continue with the definition of the blanket time: 

(5.2) B2:=in{ L VjWeG: b^ < 2 ) . 

Let us further denote 

(5.3) B 2 :=maxE„(i3 2 ) 

It is known from [8] that there exist universal constants C and C such that 
C't cov < B 2 < Ct cov . 

Thus, our first goal is to show that at time 

8B 2 + \G\t?™; Gl (r H ) + t*ff (t G ) =: 8B 2 + \G\t u H + t G =: f 
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we have for any starting state (/, x) that 



(5-4) P^jlTf >*»]<- 



We remind the reader that r| = t° + tq {X t o ) and thus the following union 
bound holds: 

P [r 2 ° > e\ < P [B 2 > 8B 2 ] + P [r° > |G|ift + 8B 2 |i3 2 < 8B 2 ] 
(5 ' 5) +maxP„ [r G > t G |6 2 < 8B 2 ,r* < 8B 2 + |G|t&] , 

where in the third term we mean that we restart the chain after time 8B 2 + 
and measure tq starting from there. The first term on the right hand 
side is less than 1/8 by Markov's inequality, the third is less than 1/16 by 
the definition of the worst case quantile. The second term can be handled 
by conditioning on the local time sequence of vertices and on the blanket 
time: (for shorter notation we introduce t\ := \G\tjj + 8B 2 ) 
(5.6) 

P[r° > |G|^ + 8B 2 |£ 2 <8B 2 ] 

= Y, ^[^■■{rH(w)>L w (t 1 )}\(L v (t 1 )) v ,B 2 = s)-P[(L v (t 1 )) v ,B 2 = s] 

s<8B 2 ,{L v (ti)) v 

The fact that i3 2 < 8B 2 means that the number of visits to every vertex 
wGG must be greater than half of the average, which is at least \t\. Since 
L w (t) is twice the number of visits by (3.6), {th{w) > L w (ti)} C {th(w) > 
By the definition of the quantiles, 

Ph[T H (w)>t U H }< 



16|G| 



holds for every h £ H and w € G. Applying a simple union bound on the 
conditional probability on the right hand side of (5.6) yields 

P^) [r» > h\B 2 < 8B 2 ] < Yl leTcr) P ^ L ^v^ 2 = s] 

s<8B,(L v (ti))) 1 l/ 

1 

where we used that the sum of the probabilities on the right hand side is 
at most 1. Combining these estimates with (5.5) yields (5.4). It remains 
to relate the worst-case quantiles to the total variation mixing times. Here 
we will make use of the separation-optimal property of th and tq. Now 
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just consider the walk on G. Let us start the walker on G from an initial 
state xo G G for which the maximum is attained in the definition (5.1) of 
the quantile ^^(tg)- Then, by (3.15) we have that one step before the 
quantile we have 



16 ~ 



tg > tytfiTc) ~ 1 = s Xo (r G ) - l) 

<^Q(^iT(^)-i)). 

This immediately implies that ^{ty^ (tg) ~~ ■"■) — ^ mix m) ■ ^ ^he sub- 
multiplicative property of the total variation distance d(kt) < 2 k d(t) k we 
have that i m i x (G, ^) < 6i m j x (G, |). So we arrive at 

(5.7) t^f(r G )-l<12t mix (G) 

Similarly, starting all the lamps from the position ho where the maximum is 

.quant / 
C 1/16|G| ( 



attained in the definition of = tf^, G > (t~h), one step before the quantile 



we have 

1 



< P ho [t h >e H -l] = s ho (t u H - 1) < Ad{(f H - l)/2) 



16|G| 
So we have 

(5-8) te |W-l)<u(ff (ii 4 



2 V "l/16|G| V'-H^ V — " mlx V ' 64|G| 

On the other hand, on the whole lamplighter chain H I G we need the 
other direction: For every starting state (f,x) (3.14) and (5.4) implies that 

%r)(0 < a m (f) < P { f, x) [r 2 * > f] < 1/4 

Maximizing over all states (/, x) yields 

(5.9) t mix (H I G) < f. 

Putting the estimates in (5.7) and (5.8) to (5.9), we get that 

t m ix{H I G) < t < 8B 2 (G) + 12i mix (G) + 1 + 2\G\ t 

mix 



64|G|y 2 

Since B 2 (G) < Ct cov (G), and t mix (G) < 2t hit (G) < 2t cov (G) for any G (see 
for instance [13]), the assertion of Theorem 1.4 follows with G 2 = 8(G + 3), 
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where C is the universal constant relating the blanket time B2 to the cover 
time t cov in [8]. 

We remark why we did not make the constant C2 explicit: If the blanket 
time B2 were not used in our estimates, the error probability that some 
vertex w £ G does not have enough local time would need to be added. 
This, similarly to the term (4.6) behaves like \G\e~ c ( tcov+ ^ tm ^ H '^ /thit . 
If we do not assume anything about the relation of thit(G9 and t cov (G) 
and on t m i x (H, ^), then this error term will not necessarily be small. For 
example, if G n is a cycle of length n, H n is a sequence of expander graphs, 
then t cov (G n ) = t hit (G n ) = G(n 2 ), and t mix (H,^) = \og\H\ • log |C| = 
log \H\ log re, and we see that the term is not small if log \H\ = o{n/ logn). 

5.2. Proof of the lower bounds of Theorem 1.4- As we did with the re- 
laxation time, it is enough to prove that all the bounds are lower bounds 
separately, then take an average. First we start showing that the upper 
bound is sharp in 1.7 under the assumption that there is a strong stationary 
time th with halting states. 

5.2.1. Lower bound under Assumption (A). We first aim to show that 
c \G\t mbL (H,j^)<t mix (HlG). 

Consider the stopping time r° constructed in Lemma 3.3. Corollary 3.4 
tells us that the tail of r° lower bounds the separation distance at time t. 
We again emphasize that this bound holds only if th in the construction of 
r° is not only separation optimal but it also has a halting state. Our first 
goal is to lower bound the tail of r <> ', then relate it to the total variation 
distance. 

First set 

(5.10) 4 := ^ /2/2 (t h ) - 1, f:=\\G\t^ 

clearly this time is nontrivial if t^^ 1/2 / 2 (Tff) 7^ 1- We handle the case if it 
equals 1 later. We can estimate the upper tail of t° by conditioning on the 
number of moves on the lamp graphs H v , v £ G: 
(5.11) 

P [r° >f]>P[3»eG: t h (w) > L w (t°)] 

> Yl P [^w e G : t h (w) > L w (f)\(L v {e)) v ] P [{L v (f)) v ] 

(L v (t°)) v 

For each sequence (L v (t°)) ve G we define the random set 
S (Lv)v :={w£G: L w (f) < 4} 
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Since X™-^^) = 2i° = ^|G|^, we have that for arbitrary local time 
configuration (L v (t°)) v , 

(5-12) |^)J>|G|/2. 

Thus we can lower bound (5.11) by restricting the event only to those w G G 
coordinates which belong to this set, i.e. whose local time is small: 
(5.13) 

P [t* >f}> £ P [3w G S {Lv)v : t h (w) > L w (f)\(L v (f)) v ] P [(MO)J 

> ^ p[3^G5 ( ^:r i? H>4|(^(0)JP[(^(*°))J ; 

(U(t»))„ 

where in the second line we used that for w G Sr L \ v we have {r#(ii;) > 
Lw^)} D {th(w) > t^-}. Conditioned on the sequence the times 

th{w) for it) G S(l\ are independent. On each lamp graph H(v) let us 
pick the starting state to be ho G H where the maximum is attained in the 
definition of t^^y^.^Tn)- Since tjj is one step before the quantile, we have 

(5-14) P ho [t h (w) > t^p /2 (r H ) - l] > |GT 1/2 /2. 

We need to start the lamp-chains from the worst-case scenario ho G H for 
two reasons: First, we needed to define the quantile as in (5.1) to be able to 
relate it to the total variation mixing time on H, see below. Then, the fact 
that ^ uant was defined as the worst-case starting state quantile means that 
for other starting states the quantile may be smaller, and the lower bound 
can possibly fail. 

Combining (5.14) with (5.12) and the conditional independence gives us 
the following stochastic domination from below to the event in (5.13) 

3w G S (Lv)v : r H (w) > t* H \{L w (f)) w ] > P[V > 0], 

where V is a Binomial random variable with parameters (|G|/2, \G\~ l l 2 /2}. 
Clearly, for |G| > 8 > 16(log2) 2 we have 

I ^\G\/2 | G |i/2 

21" 



P[^>0] = l-(1- W7 ,j >l-e 



Combining this with (5.13) and summing over all possible (L v (t°)) V £G se- 
quences we easily get that 

[G|V2 

Pfr >t°l > l_ e — 
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Then, by Corollary 3.4 we have 



In the next few steps we relate the tail of r° and tjj to the mixing time of 
the graphs. First, combining the previous inequality with (3.15) implies that 
for the starting state (h ,x) the following inequalities hold: 

l-e-l G l 1/2 /^< s ^ ) (tO)<4 ( f(t72). 

These immediately imply 

(5.15) t mix (HlG,±)> V = ~|G|4 

Now we will relate t^ H = ^f^ 1/2 , 2 (t#-) — 1 to the mixing time on H. Since 

tfj investigates the worst case initial-state scenario, by inequality (3.5) for 
any starting state h E H we have 

s h (t H + 1) < Ph [th >t H + l}< \G\- l ' 2 /2 

Using dh(t) < Sh(t) (see Lemma 3.6) and maximizing over all h G H we get 
that 

(5.16) d H (t H + 1) < \G\-^ 2 /2. 

On the other hand, the total variation distance for any Markov chain has 
the following sub-multiplicative property for any integer k, see [13, Section 
4.5]: 

(5.17) d(kt) < 2 k d(t) k . 

Taking t = t h + 1 and combining with (5.16) we have that 

d H (2(t H + 1)) < 4d H (t H + l) 2 < 4- 1 



4|G|' 

which immediately implies 

'-mix 

Combining this with (5.15) yields the desired lower bound: 

(tmix (H, rk ) - 2) < t m UH I G, I). 



16 11 V mix V ' l G l 
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Mind that the term —2 in the brackets can be dropped when picking a 
possibly smaller constant and take the graph large enough. The case when 
/2^. t h) = 1 can be handled the following way: first mind that we 
can exchange the quantile for arbitrary < a < 1, and look at the proof 
with t jgf^o j2 i T n) ■ If this is still = 1 for all a, that means that th = 1 a.s. 
In this case, it is enough to hit the vertices to mix immediately and thus 
the mixing time \G\t m i x (H) is of smaller order than the cover time t cov (G). 
The case when \G\ < 8 but \H\ — > oo is easy to see since in this case 
t m ix{H, pr) < 2t m i y (H) and one can argue that mixing on H I G requires 
mixing on a single lamp graph H w for a fixed uu G 67. Thus the lower bound 
remains valid. 

The cover time of G is already a lower bound for the — 1 lamps case 
by [18], hence also for general lamps, but, for completeness, we adjust the 
proof in [ , Theorem 19.2] to our setting. By Lemma 3.3 we can estimate 
the separation distance on H I G as 

s y >x) (t)>p m [T»>t] 

(5.18) >P iLx) [3weG:r H {w)> L w {t)} 

> P { f >x) [3w G 6? : L w {t) = 0] = P^ )X) [r cov > t] . 

Now, using the submultiplicativity of d(t) in (5.17) and the relation of the 
separation distance and the total variation distance in (3.15), we have that 
at time 8t mix (H I G, 1/4): 

sy iX) {8t mix (H I G, D) < 4cf {4t mix (H I G, |)) < 4^ < \ 

Combining with (5.18) yields that for every starting state we have 

P(/,aO [rcov > 8t mix (H I G, 1/4)] < 1/4. 

Thus, run the chain in blocks of 8i m ix(-ff I G, 1/4) and conclude that in 
each block it covers with probability at least 3/4. Thus, the cover time is 
dominated by 8i m i x (-ff I G, 1/4) times a geometric random variable with 
success probability 3/4, so we have 

E(/ iX) [r cov ] < llt mix (HlG,l/4). 

Maximizing the left hand side over all possible starting states yields t cov (G) < 
Himix(-£f I G, 1/4), finishing the proof. 
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5.2.2. Proof of the lower bound of Theorem 1-4, without assumption (A). 
Now we turn to the general case and first show that c i re i(-ff)|Cr| log \G\ 
is a lower bound. No laziness assumption on the chain on H is needed to 
get this bound. We will use a distinguishing function method. Namely, take 
an eigenfunction <j>2 of the transition matrix Q on H corresponding to the 
second eigenvalue Xh • Then let us define ip : H I G — » C: 



(5.19) 



One can always normalize such that 



E^VO = J^[0 2 ] = Vaiv>(V0 = Yl Var -(^) = |G| • 1 



This normalization has two useful consequences: First, by Chebyshev's in- 
equality, the set A = {ip < 2|G| 1//2 } has measure at least 3/4 under station- 
arity. Second, (^2(50) := max se g 4>2{g) > 1, otherwise the variance would be 
less than 1. We aim to show that the set A has measure less then 1/2 at 
time ct re \(H)\G\ log |G| and then we are done by using the following charac- 
terization of the total variation distance, see [2, 13]: 



MllTV 



sup {v{A) 

Acn 



H{A)}. 



Let us start all the lamp graphs from go £ H where the maximum is 
attained for <p2- Then we can condition on the local time sequence and use 
the eigenvalue property of </>2 to obtain 



E 



(5.20) 



E 



m(F t ,X t ))]=B 

= 2 (5o)E 



w&G 



E4' 

wdG 



L w (t) 



Since L v (t) = 2t, we can apply Jensen's inequality on the function y \-t 
\ V H to get a lower bound on the expectation: 



(5.21) 



E, 



.w&G 



L w (t) 



2t 
\G\ 



> \G\\™ =\G\[1 



tro}(H) 



2t 
\G\ 



By giving a lower bound on the right hand side we must assume here that 
Xh > 0, or equivalently t Te [(H) > C > 1. Thus, first we handle the other 
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case, i.e. when t ve \(H) < 2. Then the lower bound we are about to show is 
of order |G7| log | C| which is always at most the order of t cov (G), due to a 
result by Feige [9] stating that for simple random walk on any connected 
graph G, t cov (G) > (1 + o(l))|G| log 

When t rc \(H) > 2, we can use that 1—x > e~ l ^ x when < x < 0.5 to get a 
lower bound on the right hand side of (5.21). Then set t = ct ie \(H)\G\ log \G\ 
turning the estimate in (5.20) into 

E (£o ,,) [ip((F t , X t ))] > |67| 1_3c 02(<?o)- 

We can easily upper bound the conditional variance as follows: 

Var[MM*)W] < B 9o [4>l{F w {t))\L w (t)} < \G\4(g ) . 
weG 

Now, let us estimate the measure of set A at time t by using the lower bound 
on the expectation: 



A<2\G\ 1/2 <V( 9 _ , X ) |V*-E(^)|>^(5o)|G'| 1 - 3c -2|G| 1 / 2 



Now we use that 02 (go) > 1 an d if c < 1/6 then on the right hand side, 
the term 02 (50) | G| 1— 3c dominates, so for |G7| large enough we can drop the 
negative term and compensate it with a multiplicative factor of 1/2, say. 
Thus, condition on the local time sequence first and see that for any sequence 
(L v (t)) veG Chebyshev's inequality yields: 

Combining this with the estimate on the conditional variance above yields 
that 

P (, ,z) i^€A\ (L w (t))J < 

This bound is independent of the local time sequence, so the law of total 
probability says we have the same upper bound without conditioning on the 
local times. Now setting c < 1/6 an |G| large enough we see that the right 
hand side can be made smaller than 1/2, finishing the proof. 

To see that the cover time is a lower bound in the general case, couple 
the chain on H I G to Z2 I 67, i.e. jump to stationary distribution on H v 
once the walker on the base hits vertex v and use [18] or [13] to see that 

tcov{G) < t m ix(^2 I G) < t mlx (H I G). 

Next we show that c|G|t m ; x (-fT) is a lower bound if the chain on H is lazy. 
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Let us start with a definition for general Markov chain lond 
istop(G) := maxmin {E[r]; r stopping time s.t. Pj;[I T = y] = ir(y) Vy G fl} . 

We call a stopping time mean-optimal if E[r] = t s t op (G). Lovasz and Winkler 
[15] show that optimal stopping rules always exist for irreducible Markov 
chains. We aim to show that 

^\G\-t stop (H)<t stop (HlG). 

Take a mean optimal stopping time t* on H I G reaching minimal ex- 
pectation, i.e. ~Etf* tX *\[r*] = t s t op (H I G) for some (f*,x*) G H I G and 
E (ZjX) [t*] < t stop (Hl G) for (f,x) ± (f,x*). 

We use this r* to define a stopping rule th{v) on H v , for every v G G. 
Namely, do the following: look at a coordinate v G G and at the chain 
restricted to the lamp graph H v , i.e. only the moves which are done on the 
coordinate H v . Then, stop the chain on H v when t* stops on the whole 
HlG. 

Start the chain from any (J^,xq). Since YlveG = 2t, we have 

Take the vertex w £ G (which can depend on xq), which minimizes the 
expectation Ej v ^[th(w)]. Clearly for this vertex the expected value must 
be less than the average: 

E f0 [r H } < ^E iL>X0) [r*] 

The left hand side is at least as large as what a mean-optimal stopping rule 
on H can achieve, and the right hand side is at most ^t stop (H I G). Thus 
we arrive at 

^|G|t st op(i?) < t stop (HlG). 

In the last step we use the equivalence from the paper [19, Corollary 2.5] 
stating that t st0 p and i m i x are equivalent up to universal constants for lazy 
reversible chains and get that 



.veG 



2E 



(/o,*o)L r 



Cl \G\t mix (H)<t mix (HlG). 
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6. Further directions. The next step of understanding generalized 
lamplighters walks might be to investigate which properties on G and H are 
needed to exhibit cutoff (for a definition see [2, 13]), or to determine the 
mixing time in the uniform metric. 

For 7j2 I 67, already [11] implies a total variation cutoff with threshold 
\t cov {K n ) for G being the complete graph and that there is no cutoff if G 
is a cycle of length n. The results of [ ] include a proof of total variation 
cutoff for Z2 I with threshold i cov (Z^). The results in [16] also includes 
cutoff at l/2t cov (G n ), with some uniform local transience assumptions on 
G n . Further, Levi [14] proved that the wreath product of two complete 
graphs K n x I K n , < A < 1 exhibits a cutoff at (1 + A)/2nlogn. 

For the mixing time in the uniform metric, we know [18, Theorem 1.4] 
that if G is a regular graph such that thit(G') < i^|6?|, then there exists 
constants c, C depending only on K such that 

(6.1) c\G\(t Iel (G) + log \G\) < t u (Z 2 l G) < C\G\(t mix (G) + log 

These bounds fail to match in general. For example, for the hypercube Z2, 
t re i(Zg) = Q(d) [13, Example 12.15] while t mix (Z$) = 8(dlogd) [13, Theo- 
rem 18.3]. Then [12] showed that the lower bound is sharp in (6.1) under 
conditions which are satisfied by the d(n) dimension tori G n = Z^ n ^ for 
arbitrary chosen n and d(n). 

7. Acknowledgement. We thank Elisabetta Candellero, Gabor Pete 
and Tim Kam Wong for useful comments and Louigi Addario-Berry, Perla 
Sousi and Peter Winkler for a useful discussion of halting states. 

References. 

[1] David Aldous and Persi Diaconis. Shuffling cards and stopping times. Amer. Math. 

Monthly, 93:243-297, 1986. 
[2] David Aldous and James Allen Fill. Reversible Markov Chains and Random Walks 

on Graphs. University of California, Berkeley, 2002. 
[3] Martin T. Barlow, Jian Ding, Asaf Nachmias, and Yuval Peres. The evolution of the 

cover time. Combm. Probab. Comput., 20(3):331-345, 2011. 
[4] M.J. A. M. Brummelhuis and H. J. Hilhorst. Covering of a finite lattice by a random 

walk. Physica A., 176:387-408, 1991. 
[5] A. Dembo, Y. Peres, J. Rosen, and O. Zeitouni. Cover times for Brownian motion 

and random walks in two dimensions. Ann. Math., 160(2):433-464, 2004. 
[6] A. Dembo, Y. Peres, J. Rosen, and O. Zeitouni. Late points for random walk in two 

dimensions. Ann. Probab., 34:219-263, 2006. 
[7] Persi Diaconis and James Allen Fill. Strong stationary times via a new form of 

duality. Ann. Probab., 18(4):1483-1522, 1990. 
[8] Jian Ding, Yuval Peres, and James R. Lee. Cover times, blanket times, and majorizing 

measures. Annals of Mathematics, 175(3):1409-1471, 2012. 



30 



JULIA KOMJATHY AND YUVAL PERES 



[9] Uriel Feige. A tight lower bound on the cover time for random walks on graphs. 

Random Structures and Algorithms, 6(4):433-438, 1995. 
[10] James Fill and Jr. Clyde Schoolfield. Mixing times for markov chains on wreath 

products and related homogeneous spaces. Electron. J. Probab., 6:no. 11, 1-22, 2001. 
[11] O. Haggstrom and J. Jonasson. Rates of convergence of lamplighter processes. 

Stochastic Processes and their Applications, 67:227-249, 1997. 
[12] J. Komjathy, J. P. Miller, and Y. Peres. Uniform mixing time for random walk on 

lamplighter graphs, http://arxiv.org/pdf/1109.4281.pdf. 
[13] D. Levin, Y. Peres, and E. Wilmer. Markov Chains and Mixing Times. American 

Mathematical Society, 2008. 
[14] Nathan Levy. Mixing time for lamplighter graphs. Master's thesis, University of 

California, Berkeley, 2006. 
[15] Laszlo Lovasz and Peter Winkler. Efficient stopping rules for markov chains. In 

Proceedings of the twenty-seventh annual ACM symposium on Theory of computing, 

STOC '95, pages 76-82, New York, NY, USA, 1995. ACM. 
[16] Jason P. Miller and Yuval Peres. Uniformity of the uncovered set of random walk 

and cutoff for lamplighter chains. Annals of Probability, 2011. 
[17] Igor Pak. Random Walks on Groups: Strong Uniform Time Approach. PhD thesis, 

Harvard University, 1997. 
[18] Yuval Peres and David Revelle. Mixing times for random walks on finite lamplighter 

groups. Electronic Journal of Probability, 9:825-845, 2004. 
[19] Yuval Peres and Perla Sousi. Mixing times are hitting times of large sets. 

http://arxiv.org/abs/1108.0133. 
[20] Peter Winkler and David Zuckerman. Multiple cover time. Random Structures Al- 
gorithms, 9(4):403-411, 1996. 



Budapest University of Technology and Economics, komyjuamath.bme.hu Microsoft Research peresamicrosoft.com 



